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Summary. This chapter deals with the exact enumeration of certain classes of self- 
avoiding polygons and polyominoes on the square lattice. We present three general 
approaches that apply to many classes of polyominoes. The common principle to all 
of them is a recursive description of the polyominoes which then translates into a 
functional equation satisfied by the generating function. The first approach applies 
to classes of polyominoes having a linear recursive structure and results in a rational 
generating function. The second approach applies to classes of polyominoes having 
an algebraic recursive structure and results in an algebraic generating function. The 
third approach, commonly called the Temperley method, is based on the action of 
adding a new column to the polyominoes. We conclude by discussing some open 
questions. 



1 Introduction 

1.1 Subclasses of polygons and polyominoes 

This chapter deals with the exact enumeration of certain classes of (self- 
avoiding) polygons and polyominoes. We restrict our attention to the square 
lattice. As the interior of a polygon is a polyomino, we often consider polygons 
as special polyominoes. The usual enumeration parameters are the area (the 
number of cells) and the perimeter (the length of the border). The perimeter 
is always even, and often refined into the horizontal and vertical perimeters 
(number of horizontal/ vertical steps in the border). Given a class C of polyomi- 
noes, the objective is to determine the following complete generating function 
of C: 

C(x,y,q) = ]T xMP)/2^P(P)/2 9 a(P) ; 
Pec 
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where hp{P), vp(P) and a(P) respectively denote the horizontal perimeter, 
the vertical perimeter and the area of P. This means that the coefficient 
c(m,n,k) of x m y n q k in the series C(x,y,q) is the number of polyominoes in 
the class C having horizontal perimeter 2m, vertical perimeter 2n and area k. 
Several specializations of C(x, y, q) may be of interest, such as the perimeter 
generating function C(t,t,l), its anisotropic version C(x, y, 1), or the area 
generating function (7(1, 1, q). From such exact results, one can usually derive 
many of the asymptotic properties of the polyominoes of C: for instance the 
asymptotic number of polyominoes of perimeter n, or the (asymptotic) average 
area of these polyominoes, or even the limiting distribution of this 3X6 EL, 3-S 
n tends to infinity (see Chapter 11). The techniques that are used to derive 
asymptotic results from exact ones are often based on complex analysis. A 
remarkable survey of these techniques is provided by Flajolet and Sedgewick's 
book [33]. 

The study of sub-classes of polyominoes is natural, given the immense dif- 
ficulty of the full problem (enumerate all polygons or all polyominoes). The 
objective is to develop new techniques, and to push further and further the 
border between solved and unsolved models. However, several classes have an 
independent interest, other than being an approximation of the full problem. 
For instance, the enumeration of partitions (Fig. 2(e)) is relevant in number 
theory and in the study of the representations of the symmetric group. The 
first enumerative results on partitions date back, at least, to Euler. A full 
book is devoted to them, and is completely independent of the enumeration 
of general polyominoes [2] . Another example is provided by directed polyomi- 
noes, which are relevant for directed percolation, but also occur in theoretical 
computer science as binary search networks [55]. 

All these classes will be systematically defined in Section 1.3. For the mo- 
ment, let us just say that most of them are obtained by combining conditions 
of convexity and directedness. 

From the perspective of subclasses as an approximation to the full prob- 
lem, it is natural to ask how good this approximation is expected to be. The 
answer is quite crude: these approximations are terrible. For a start, all the 
classes that have been counted so far are exponentially small in the class of all 
polygons (or polyominoes). Hence we cannot expect their properties to reflect 
faithfully those of general polygons/polyominoes. Why would the properties 
of a staircase polygon (Fig. 2(b)) be similar to those of a general self-avoiding 
polygon? Indeed, the number of staircase polygons of perimeter 2n grows like 
2 2n n -3 / 2 (up to a multiplicative constant), while the number of general poly- 
gons is believed to be asymptotically /i 2n n -5 / 2 , with fi = 2.638 . . . [36]. The 
average width of a staircase polygon is clearly linear in n. while the width 
of general polygons is conjectured to grow like n 3 / 4 (see [42]). And so on! In 
this context, it may be a pure coincidence that the average area of polygons 
of perimeter 2n is conjectured to scale as ?i 3 / 2 (see [27]), just as it does for 
staircase polygons. But it is also conjectured that the limit distribution of 
the area of 2n-step polygons (normalized by its average value) coincides with 
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the corresponding distribution for staircase polygons, and for other exactly 
solved classes. The universality of this distribution may not be a coincidence 
(see Chapter 11 or [50] for more references and details). 

1.2 Three general approaches 

In this chapter, we present three robust approaches that can be applied to 
count many classes C of polyominoes. The common principle of all of them is 
to translate a recursive description of the polyominoes of C into a functional 
equation satisfied by the generating function C(x,y,q). Some readers may 
prefer seeing a translation in terms of the coefficients of C(x, y, q), namely the 
numbers c(m, n, k). This translation is possible, but it is usually easier to work 
with a functional equation than with a recurrence relation. The applicability 
of each of these three approaches depends on whether the polyominoes of C 
have, or don't have, a certain type of recursive structure. 

The most versatile approach is probably the third one, as it virtually ap- 
plies to any class of polyominoes having a convexity property. It was already 
used by Temperley in 1956 [52] and is often called, in the physics literature, 
the Temperley approach. However, it often produces functional equations that 
are non-trivial to solve, even when the solution finally turns out to be a simple 
rational or algebraic series (these terms will be defined in Section 1.3 below). 
From a combinatorics point of view, it is important to get a better understand- 
ing of the simplicity of these series, and this is what the first two approaches 
provide: the first one applies to classes C having a linear structure, and gives 
rise to rational generating functions. The second applies to classes having an 
algebraic structure, and gives rise to algebraic generating functions. 

We have chosen to present these three approaches because, in our opinion, 
they are the most robust ones, and we want to provide effective tools to the 
reader. To our knowledge, almost all the classes that have been solved exactly 
can be solved using one (or several) of these approaches. Still, certain results 
have been given a beautiful combinatorial explanation via more specific tech- 
niques. Let us mention two tools that are often involved in those alternative 
approaches. The first tool is specific to the enumeration of polygons, and con- 
sists in studying classes of possibly self-intersecting polygons, and then using 
an inclusion-exclusion principle to eliminate the ones with self-intersections. 
This idea appears in an old paper of Polya [43] dealing with staircase polygons, 
and was further exploited to count more general polygons [30, 31], including 
in dimensions larger than two [35, 13]. The second tool is the use of bijections 
and is of course not specific to polyomino enumeration. The idea is to describe 
a one-to-one correspondence between the objects of C and those of another 
class T>, having a simpler recursive structure. In this chapter, even though we 
often use encodings of polyominoes by words, these encodings are usually very 
simple and do not use the full force of bijective methods, which is clearly at 
work in papers like [16] or [20]. 
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The structure of the chapter is simple: the three approaches we discuss 
are presented, and illustrated by examples, in Sections 2, 3 and 4 respectively. 
A few open problems which we consider worth investigating are discussed in 
Section 5. 

We conclude this introduction with definitions of various families of poly- 
ominoes and formal power scries. 

1.3 A visit to the zoo 

All the classes studied in this chapter are obtained by combining several con- 
ditions of convexity and directedness. Let us first recall that a polyomino P 
is a finite set of square cells of the square lattice whose interior is connected. 
The set of centres of the cells form an animal A (Fig. 1). The connectivity 
condition means that any two points of A can be joined by a path made up 
of unit vertical and horizontal steps, in such a way that every vertex of the 
path lies in A. The animal A is North-East directed (or directed, for short) if it 
contains a point i>o, called the source, such that every other point of A can be 
reached from vq by a path made of North and East unit steps, having all its 
vertices in A. In this case, the polyomino corresponding to A is also said to be 
NE-directed. One defines NW, SW and SE directed animals and polyominocs 
similarly. 

A polyomino P is column- convex if its intersection with every vertical line 
is connected. This means that the intersection of every vertical line with the 
corresponding animal A is formed by consecutive points. The border of P is 
then a polygon. Row-convexity is defined similarly. Finally, P is <i+-convex 
if the intersection of A with every line of slope 1 is formed by consecutive 
points. One defines d_ -convex polyominoes similarly. 




(a) (b) (c) (d) 



Fig. 1. From left to right: (a) a polyomino and the corresponding animal, (b) a 
NE-directed animal, (c) a column-convex polygon, (d) a d_-convex polyomino. 

As discussed in [8] , the combination of the four direction conditions and the 
four connectivity conditions gives rise to 31 distinct (non-symmetric) classes 
of polyominoes having at least one convexity property. To these 31 classes we 
must add the 4 different classes satisfying at least one directional property. 
Some prominent members of this zoo, which will occur in the forthcoming 
sections, are shown in Fig. 2: 
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• convex polyominoes (or polygons): polyominoes that are both column- and 
row-convex, 

• staircase polyominoes (or polygons): convex polygons that are NE- and 
SW-directed, 

• bargraphs: column-convex polygons that are NE- and NW-dirccted, 

• stacks: row-convex bargraphs, 

• partitions, a.k.a. Ferrers diagrams: convex polygons that are NE-, NW- 
and SE-directed. 

Finally, a formal power scries C(x) = C(xi, . . . , Xk) with real coefficients is 
rational if it can be written as a ratio of polynomials in the x^s. It is algebraic 
if it satisfies a non-trivial polynomial equation 

P(C(x),x 1 ,...,x k )=0. 
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Fig. 2. A photo taken at the zoo: (a) a convex polygon, (b) a staircase polygon, (c) 
a bargraph, (d) a stack, (e) a Ferrers diagram. 



2 Linear models and rational series 

2.1 A basic example: bargraphs counted by area 

Let b n denote the number of bargraphs of area n. As there is a unique bargraph 
of area 1, b\ = 1. For n > 2, there are two types of bargraphs: 

1. those in which the last (i.e., rightmost) column has height 1, 

2. those in which the last column has height 2 or more. 

Bargraphs of the first type are obtained by adding a column of height 1 to the 
right of any bargraph of area n — 1. Bargraphs of the second type are obtained 
by adding one square cell to the top of the last column of a bargraph of area 
n— 1. Since a bargraph cannot be simultaneously of type 1 and 2, this gives 

bi = 1 and for n > 2, b n = 2b n -\, 

which implies h 
rational series: 



2™ . The area generating function of bargraphs is thus a 

<1 



B(q) :=J2b n q n = T 



2q 

n>l y 
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2.2 Linear objects 

The above enumeration of bargraphs is based on a very simple recursive de- 
scription of bargraphs. This description only involves the following two con- 
structions: 

1. taking disjoint unions of sets, 

2. concatenating a new cell to an already constructed object. 

In terms of generating functions (g.f.s), taking the disjoint union of sets means 
summing their g.f.s. while concatenating a new cell (of size 1) to all elements of 
a set means multiplying its g.f. by q. Hence the above description of bargraphs 
translates directly into a linear equation for the g.f. B(q): 

B(q)=q + qB(q)+qB(q). 

This equation reflects the fact that the set of bargraphs is the union of three 
disjoint subsets (the unique bargraph of area 1, bargraphs of type 1, bargraphs 
of type 2), and that the second and third subsets are both obtained by adding 
a cell to any bargraph. 

More generally, we will say that a class of objects, equipped with a size, is 
linear if these objects can be obtained from a finite set of initial objects using 
disjoint union and concatenation of one cell, or atom. It is assumed that the 
concatenation of an atom increases the size by 1. The construction must be 
non- ambiguous, meaning that each object of the class is obtained only once. 
The construction may involve several classes of objects simultaneously. For 
instance, the class B of bargraphs whose last column has height 1 is linear: 
the objects of B, other than the one-cell bargraph, are obtained by adding one 
cell to the right of any bargraph. The associated series B{q) is defined by the 
linear system: 

( B(q) =q + qB(q), 
\B(q)=q + qB(q)+qB(q). 

In general, the generating function of a linear class of objects is the first 
component of the solution of a system of k linear equations of the form 

k 

B l (q) = Pi(q) + q^aijBjiq) 1 < i < k, (1) 
J'=l 

where Ojj € N and each Pi{q) is a polynomial in q with coefficients in N. The 
polynomial Pi{q) counts the initial objects of type i, and there are a^j ways 
to aggregate an atom to an object of type j to form an object of type i. The 
system (1) uniquely defines each series Bi(q), which is rational. The series 
obtained in this way are called N-rational. Their study is closely related to 
the theory of regular languages [51]. 
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2.3 More linear models 

In this section we present three typical problems that can be solved via a linear 
recursive description. The first one is the perimeter enumeration of Ferrers 
diagrams (and stacks). The second one generalizes the study of bargraphs 
performed in Section 2.1 to all column-convex polygons (and to the subclass 
of directed column-convex polygons) counted by area. The third one illustrates 
the role of linear models in the approximation of hard problems, and deals 
with the enumeration of self-avoiding polygons confined to a narrow strip. In 
passing, we illustrate the two following facts: 

1. it may be useful to begin by describing a size preserving bijection between 
polyominoes and other objects (having a linear structure), 

2. linear constructions are conveniently described by a directed graph when 
they become a bit involved. 

Ferrers diagrams by perimeter 

The set of Ferrers diagrams can be partitioned into three disjoint subsets: first, 
the unique diagram of (half-)perimeter 2; then, diagrams of width at least 2 
whose rightmost column has height 1; finally, diagrams with no column of 
height 1 . The latter diagrams can be obtained by duplicating the bottom row 
of another diagram (Fig. 3). 




Fig. 3. Recursive description of Ferrers diagrams. 

From this description, it follows that the set of words that describe the 
North-East boundary of Ferrers diagrams, from the NW corner to the SE one, 
admits a linear construction. This boundary is formed by East and South 
steps, and will be encoded by a word over the alphabet {e, s}. Any word 
over this alphabet that starts with an e and ends with an s corresponds to 
a unique Ferrers diagram. Let T be this class of words, and let C be the set 
of all non-empty prefixes of words of J-. Then T and C admit the following 
linear description: 

T = Cs and C = {e} U Ce U Cs. 

In these equations, the notation Cs means {us, u € £}, and the unions are 
disjoint. The series that count the words of these sets by their length (number 
of letters) are thus given by the linear system 
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F(t)=tL(t) and L(t) = t + 2tL(t). 

Since the length of a coding word is the half-perimeter of the associated dia- 
gram, this provides the length g.f.: 



F(t) 



t- 



1 - 2t 



2"~ 2 f 



By separately counting East and South steps, we obtain the equations 

F(x,y) = yL(x,y) and L(x, y) = x + xL(x, y) + yL(x,y), 
and hence the anisotropic perimeter g.f. of these diagrams: 

xy ( m + n — 2^ 



(2) 



F(x,y) 



1-x-y 



E 

m,n>l 



m — 1 



x m y n . 



A similar treatment can be used to determine the perimeter g.f. of stack 
polygons: the construction schematized in Fig. 4 gives: 

S(x,y) = xy + xS(x,y) + S+(x,y), S+{x,y) = yS(x,y) +xS + (x,y) 

which yields 



S{x,y) = 



xy(l — x) 
(1 - xf 



y 






Fig. 4. Recursive description of stack polygons. 



Column-convex polygons by area 

Consider a column-convex polygon P having n cells. Let us number these 
cells from 1 to n as illustrated in Fig. 5. The columns are visited from left to 
right. In the first column, cells are numbered from bottom to top. In each of 
the other columns, the lowest cell that has a left neighbour gets the smallest 
number; then the cells lying below it are numbered from top to bottom, and 
finally the cells lying above it are numbered from bottom to top. Note that for 
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Fig. 5. A column-convex polygon, with the numbering and encoding of the cells. 



all i, the cells labelled 1, 2, . . . , i form a column-convex polygon. This labelling 
describes the order in which we are going to aggregate the cells. 

Associate with P the word u = u\ ■ ■ ■ u n over the alphabet {a, b, c} defined 

by 

Ui = c (like Column) if the i th cell is the first to be visited in its column. 

- m = b (like Below) if the i th cell lies below the first visited cell of its 
column, 

- Ui — a (like Above) if the i th cell lies above the first visited cell of its 
column. 

Then, add a bar on the letter Ui if the i th cell of P has a South neighbour, 
an East neighbour, but no South-East neighbour. (In other words, the barred 
letters indicate where to start a new column, when the bottommost cell in 
this new column lies above the bottommost cell of the previous column.) 
This gives a word v over the alphabet {a, 6, c, a, 6, c}, and P can be uniquely 
reconstructed from v. 

We now focus on the enumeration of these coding words. Let C be the set 
of all prefixes of these words, including the empty prefix e. By considering 
which letter can be added to the right of which prefix, we are led to partition 
C into five disjoint subsets C\, . . . ,£5, subject to the following linear recursive 
description: 

Ci={e}, 

£2 = C\c U £20 U £3(1 U £40, £4 = £20 U £30 U £40 U £56, (3) 
£ 3 = £ 2 c U £36 U £ 3 c, £ 5 = £ 2 c U £36 U £ 3 c U £56. 

The words of £4 and £5 are those in which a barred letter (the rightmost one) 
still waits to be "matched" by a letter c creating a new column. The words 
of £2 U £3 are those that encode column-convex polygons. This construction 
is illustrated by a directed graph in Fig. 6: every path starting from 1 and 
ending at i corresponds to a word of d, obtained by reading edge labels. The 
series counting the words of Li by their length satisfy: 

In = 1, 

L 2 = q {L\ + L 2 + L 3 + £4) ) Li = q (L 2 + L 3 + L 4 + L 5 ) , 
L3 = q {Li + L 3 +L 3 ), L 5 = q (L 2 + L 3 + L 3 + L 5 ) . 

The area g.f. of column-convex polygons is C(q) = L 2 {q) + L 3 (q). Solving the 
above system gives: 
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Fig. 6. Linear construction of the words of L. The words of Li encode the paths 
starting from 1 and ending at i. 



C(a) = iilzi) 

W l-5q + 7q 2 -4(7 3 ' 

We believe that this result was first published by Temperley [52] . 

A column-convex polygon is directed if and only if its coding word does 
not use the letter b. We obtain a linear description of the prefixes of these 
words by deleting all terms of the form Lib in the description (3). The class 
£5 becomes irrelevant. Solving the associated system of linear equations gives 
the area g.f. of directed column-convex polygons: 

1 -3q + q A 

As far as we know, this result was first published by Klarner [38]. 



Polygons confined to a strip 

Constraining polyominoes or polygons to lie in a strip of fixed height endows 
them with a linear structure. This observation gives a handle to attack diffi- 
cult problems, like the enumeration of general self-avoiding polygons (SAP), 
self-avoiding walks, or polyominoes [1, 5, 48, 56, 57]. As the size of the strip 
increases, the approximation of the confined problem to the general one be- 
comes better and better. This widely applied principle gives, for instance, 
lower bounds on growth constants that are difficult to determine. We illus- 
trate it here with the perimeter enumeration of SAP confined to a strip. 

Before we describe this calculation, let us mention a closely related idea, 
which consists of considering anisotropic models (for instance, SAP counted 
by vertical and horizontal perimeters), and fixing the number of atoms lying 
in one direction, for instance the number of horizontal edges. Again, this 
endows the constrained objects with a linear structure. The denominators 
of the rational generating functions that count them often factor in terms 
(1 — y l ). The number of exponents i that occur can be seen as a measure of 
the complexity of the class. This is often observed only at an experimental 
level. However, this observation has been pushed in some cases to a proof that 
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the corresponding generating function is not D-finite, and in particular not 
algebraic (see for instance [49], and Chapter 5). 

But let us return to SAP in a strip of height k (a k-strip). A first ob- 
servation is that a polygon is completely determined by the position of its 
horizontal edges. Consider the intersection of the polygon with a vertical line 
lying at a half-integer abscissa (a cut): the strip constraint implies that only 
finitely many configurations (or states) can occur. The number of such states 
is the number of even subsets of {0, 1, . . . , k}. This implies that SAP in a strip 
can be encoded by a word over a finite alphabet. For instance, the polygon of 
Fig. 7 is encoded by the word bbbaabaaba. 



abb 

Fig. 7. A self-avoiding polygon in a strip of height 2, encoded over a 3-letter alpha- 
bet. 

It is not hard to see that for all k, the set of words encoding SAP confined 
to a strip of height k has a linear structure. To make this structure clearer, 
we refine our encoding: for every vertical cut, we not only keep track of its 
intersection with the polygon, but also of the way the horizontal edges that 
meet the cut are connected to the left of the cut. This does not change the 
size of the alphabet for k = 2, as there is a unique way of coupling two edges. 
However, if k = 3, the configuration where 4 edges are met by the cut gives 
rise to 2 states, depending on how these 4 edges are connected (Fig. 8). The 
number of states is now the number of non-crossing couplings on {0, 1, . . . , k}. 
This is also the size of our encoding alphabet A. 

abb c c d e f 

Fig. 8. A self-avoiding polygon in a strip of height 3, encoded by the word 
dbafcceaafcaceeabcc. 

Fix k, and let S be the set of words encoding SAP confined to a fc-strip. 
The set C of prefixes of words of S describes incomplete SAP, and has a simple 
linear structure: for every such prefix w, the set of letters a such that wa lies 
in £ only depends on the last letter of w. In other words, these prefixes are 
Markovian with memory 1. For every letter a in the encoding alphabet, we 
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denote by C a the set of prefixes ending with the letter a. The linear structure 
can be encoded by a graph, from which the equations defining the sets C a can 
automatically be written. This graph is shown in Fig. 9 (left) for k = 2. Every 
path in this graph starting from the initial vertex corresponds to a word of 
C, obtained by reading vertex labels. The linear structure of prefixes reads: 

C a = (c + C a + C b + C b )a, C b ={e + L a +C b )b, C b = (e + C a + C b )b. 

From this we derive linear equations for incomplete SAP, where every hori- 
zontal edge is counted by y/x, and every vertical edge by y/y = z: 

L a = (z 2 + L a + zL b + zL~ b )x, L b = (z + zL a + L b )x, L b = (z + zL a + L b )x. 

These equations keep track of how many edges are added when a new letter 
is appended to a word of C. They can be schematized by a weighted graph 
(Fig. 9, middle). Now the (multiplicative) weight of a path starting at is 
the weight of the corresponding incomplete polygon. Finally, the completed 
polygons are obtained by adding vertical edges to the right of incomplete 
polygons. This gives the generating function of SAP in a strip of height 2 as: 

S 2 (x, y) = z 2 L a + zL b + zL b . 




Fig. 9. The linear structure of SAP in a 2-strip. 



Clearly, we should exploit the horizontal symmetry of the model to obtain 
a smaller set of equations. The letters b and b playing symmetric roles, we 
replace them in the graph of Fig. 9 by a unique vertex j3, such that the 
generating function of paths ending at (3 is the sum of the g.f.s of paths 
ending at b and b in the first version of the graph (Fig. 9, right). Introducing 
the series Lp = L b + L b , we have thus replaced the previous system of four 
equations by 



L a = x(z 2 + L a + zLp), Lp = x(2z + 2zL a + L p ), S 2 {x,y) = z 2 L a + zLp, 
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from which we obtain 

xy(2 -2x + y + 3xy) 
S ^ V) = (l-x) 2 -2x 2 y ■ 

Note that this series counts polygons of height 1 twice, so that we should 
subtract S\(x, y) = xy/(l — x) to obtain the g.f. of SAP of height at most 2, 
defined up to translation. 




Fig. 10. The linear structure of SAP in a 3-strip. 



For k = 3, the original alphabet, shown in Fig. 8, has 8 letters, but two 
pairs of them play symmetric roles. After merging the vertices b and b on the 
one hand, c and c on the other, the condensed graph, with its x, z weights, is 
shown in Fig. 10. The corresponding equations read 

L a = x (z 3 + L a + zLp + z 2 L 1 + z 2 L d + zL e ) , 
Lp = x (2 z 2 + 2 zL a + (1 + z 2 )Lf} + zL 1 + 2 zL d ) , 
L~ ( = x (2 z + 2 z 2 L a + zLp + L 7 + 2 zLf) , 
L d = x (z + z 2 L a + zLp + L d ) , 
L e = x 2 (z 2 + zL-y + L e ) , 
Lf = x 2 (zL a + Lf) , 

and the generating function of completed polygons is 

£>3(x, y) = z 3 L a + z 2 Lp + zL 1 + zL d + z 2 Lf = 



xyN(x,y) 



D(x,y) 
where 

N(x, y)=3 (x + l) 2 (1 - xf + (5 x + 2) (2 x - 1) (a: + l) 2 (x - lf y 
- (x - 1) (6a; 6 + 4a; 5 - 18 .t 4 - 6a; 3 + 11 x 2 + 8x + l) y 2 

- x (x + 1) (2 x A + 6 x 3 - 8 a; 2 + 4 x + l) y 3 



and 
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D(x,y) = (x + 1) 2 (x - l) 6 -x(l + 4 a;) (x + if (x - ify 
+ x 2 (3x 4 + Ax 3 - 6x 2 - 8a; - 3) (x - l) 2 y 2 

+ x 3 (x + 1) (x 3 + 3 x 2 - 5 x + 3) y 3 . 

By setting x = y = t, we obtain the half-pcrimetcr generating function of 
SAP in a 3-strip, 



and, by looking at the smallest pole of this series, we also obtain the (very 
weak) lower bound 1.68... on the growth constant of square lattice self- avoiding 
polygons. 

The above method has been automated by Zcilberger [56]. It is not hard 
to see that the number of states required to count polygons in a /c-strip grows 
like 3 fe , up to a power of k. This prevents one from applying this method for 
large values of k. Better bounds for growth constants may be obtained via 
the finite lattice method described in Chapter 7. A further improvement is 
obtained by looking at a cylinder rather than a strip [5]. 

2.4 q-Analogues 

By looking at the height of the rightmost column of Ferrers diagrams, we have 
described a linear construction of these polygons that proves the rationality 
of their perimeter g.f. (Fig. 3). Let us examine what happens when we try to 
keep track of the area in this construction. 

They key point is that the area increases by the width of the polygon when 
we duplicate the bottom row. (In contrast, the half-perimeter simply increases 
by 1 during this operation.) This observation gives the following functional 
equation for the complete g.f. of Ferrers diagrams: 



This is a q-analogue of the equation defining F(x, y, 1), derived from (2). This 
equation is no longer linear, but it can be solved easily by iteration: 



t 2 (-8t 9 + At 8 + 10t 7 - 20t 6 - t 5 - t 4 + It 3 + 3t 2 - 7t + 3) 



4 1 10 - 2 1 9 - 5 t s + 8 1 7 - t 6 + 2 1 5 - 4 1 4 + 2 1 3 + 3 1 2 - 4 1 + 1 



F(x, y, q) = xyq + xqF(x, y, q) + yF(xq, y, q). 



F(x,y,q) 



xyq 



+ 




1 — xq 



xyq 



+ 



jF(xq 2 7 y,q) (4) 



1 



xq 



1 — xq 1 — xq 2 1 — xq 1 — xq 




with 

(xq) = I and (xq) n = (1 - xq)(l - xq 2 ) ■ ■ ■ (1 - xq n ). 
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Similarly, for the stack polygons of Fig. 4, one obtains: 

S(x, y, q) = xyq + xqS(x, y, q) + S + (x, y, q), 
S+{x,y,q) = yS(xq,y,q) + xqS+(x,y,q). 

Eliminating the series S+ gives 

S{x, y, q) = XV<1 + - — V—^S(xq, y, q) 
1 — xq (1 — xq) 1 

= xy n q n 

4^ {xq) n -i{xq) n 

In Section 4 we present a systematic approach for counting classes of 
column-convex polygons by perimeter and area. 



3 Algebraic models and algebraic series 

3.1 A basic example: bargraphs counted by perimeter 

Let us return to bargraphs. The linear description used in Section 2.1 to count 
them by area cannot be directly recycled to count them by perimeter: indeed, 
when we add a cell at the top of the last column, how do we know if we increase 
the perimeter, or not? Instead, we are going to scan the polygon from left to 
right, and factor it into two smaller bargraphs as soon as we meet a column 
of height 1 (if any). If there is no such column, deleting the bottom row of the 
polygon leaves another bargraph. This description is schematized in Fig 11. 

Fig. 11. A second recursive construction of bargraphs. 



Let B be the set of words over the alphabet {n, s, e} that naturally encode 
the top boundary of bargraphs, from the SW to the SE corner. Fig. 11 trans- 
lates into the following recursive description, where the unions are disjoint: 

B = nCs with C = nCs U {e} U eC U nCse U nCseC. (5) 

This implies that the anisotropic perimeter g.f. of bargraphs satisfies 

B{x,y) = yL(x,y), 



L(x, y) = yL(x, y) + x + xL(x, y) + xyL(x, y) + xyL(x, y) 2 . 
These equations are readily solved and yield: 
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B{x,y) 



l-x-y-xy- - y)((l - x) 2 - y(l + x) 2 ) 
2x 



(6) 



Thus the perimeter g.f. of bargraphs is algebraic, and its algebraicity is ex- 
plained combinatorially by the recursive description of Fig. 11. 

Note that one can directly translate this description into an algebraic 
equation satisfied by B(x,y), without using the language B. This language 
is largely a convenient tool to highlight the algebraic structure of bargraphs. 
The translation of Fig. 11 into an equation proceeds as follows: there are 
two types of bargraphs, those that have at least one column of height 1, and 
the others, which we call thick bargraphs. Thick bargraphs are obtained by 
duplicating the bottom row of a general bargraph, and are thus counted by 
yB(x,y). Among bargraphs having a column of height 1, we find the single 
cell bargraph (g.f. xy), and then those of width at least 2. The latter class can 
be split into 3 disjoint classes: 

the first column has height 1: these bargraphs are obtained by adding a 
cell to the left of any general bargraph, and are thus counted by xB(x, y), 
the last column is the only column of height 1; these bargraphs are obtained 
by adding a cell to the right of a thick bargraph, and are thus counted by 
xyB(x,y), 

the first column of height 1 is neither the first column, nor the last column. 
Such bargraphs are obtained by concatenating a thick bargraph, a cell, and 
a general bargraph; they are counted by xB(x,y) 2 . 

This discussion directly results in the equation 

B(x, y) = yB(x, y) + xy + xB(x, y) + xyB(x, y) + xB(x, y) 2 . (7) 

3.2 Algebraic objects 

The above description of bargraphs involved two constructions: 

1. taking disjoint unions of sets, 

2. taking cartesian products of sets. 

For two classes Ai and A2, the element (01,02) of the product A\ x A2 is 
seen as the concatenation of the objects a\ and a-i- We will say that a class of 
objects is algebraic if it admits a non-ambiguous recursive description based 
on disjoint unions and cartesian products. It is assumed that the size of the 
objects is additive for the concatenation. For instance, (5) gives an algebraic 
description of the words of £ and B. 

In the case of linear constructions, the only concatenations that were al- 
lowed were between one object and a single atom. As we can now concatenate 
two objects, algebraic constructions generalize linear constructions. In terms 
of g.f.s, concatenating objects of two classes means taking the product of the 
corresponding g.f.s. Hence the g.f. of an algebraic class will always be the first 
component of the solution of a polynomial system of the form: 
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Ai = Pi(t,A 1} . . .,A k ) for l<i<k, 

where Pi is a polynomial with coefficients in N. Such series are called N- 
algebraic, and are closely related to the theory of context-free languages. We 
refer to [51] for details on these languages, and to [12] for a discussion of 
N-algebraic series in enumeration. 

3.3 More algebraic models 

In this section we present three problems that can be solved via an algebraic 
decomposition: staircase polygons, then column-convex polygons counted by 
perimeter (and the subclass of directed column-convex polygons) , and finally 
directed polyominoes counted by area. 

Staircase polygons by perimeter 

In Section 1.3 we defined staircase polygons through their directed and con- 
vexity properties. See Fig. 2(b) for an example. We describe here a recursive 
construction of these polygons, illustrated in Fig. 12. It is analogous to the 
construction of bargraphs described at the end of Section 3.1 and illustrated 
in Fig. 11. Denote by S(x,y) the anisotropic perimeter generating function of 
staircase polygons. 




Fig. 12. A recursive construction of staircase polygons. 

We say that a staircase polygon is thick if deleting the bottom cell of each 
column gives a staircase polygon of the same width. These thick polygons are 
obtained by duplicating the bottom cell in each column of a staircase polygon, 
so that their generating function is yS(x,y). 

Among non-thick staircase polygons, we find the single cell polygon (g.f. 
xy), and then those of width at least 2. Let P be in the latter class, and denote 
its columns Ci, . . . , Ck, from left to right. The fact that P is not thick means 
that there exist two consecutive columns, C,; and Ci+i, that overlap by one 
edge only. Let i be minimal for this property. Two cases occur: 
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the first column has height 1. In particular, i = 1. These polygons are ob- 
tained by adding a cell to the bottom left of any general staircase polygon, 
and are thus counted by xS(x, y), 
- otherwise, the columns Ci,...,C, form a thick staircase polygon, and 
Cj+i, . . . , Cfc form a general staircase polygon. Concatenating these two 
polygons in such a way that they share only one edge gives the original 
polygon P. Hence the g.f. for this case is S(x,y) 2 . 

This discussion gives the equation 

S(x,y) = yS(x,y) + xy + xS(x,y) + S{x,y) 2 

so that 



S(x, y) = — [l — x — y — y/l — 2x — 2y — 2xy + x 2 + y 2 J 

This expansion can be obtained using the Lagrange inversion formula [9] . The 
isotropic semi-perimeter g.f. is obtained by setting t = x = y: 

S(t, t) = i (1 - 2t - VI - 4i) = C n t n+1 

n>l 

where C n = ( 2 ^) /(n + 1) is the Catalan number. The same approach can 
be applied to more general classes of convex polygons, like directed-convex 
polygons and general convex polygons. See for instance [9, 23]. 

Column-convex polygons by perimeter 

We now apply a similar treatment to the perimeter enumeration of column- 
convex polygons (cc-polygons for short). Their area g.f. was found in Sec- 
tion 2.3. Let C denote the set of these polygons, and C(x,y) their anisotropic 
perimeter generating function. Our recursive construction requires us to in- 
troduce two additional classes of polygons. The first one, Ci, is the set of 
cc-polygons in which one cell of the last column is marked. The correspond- 
ing g.f. is denoted C\{x, y). Note that, by symmetry, this series also counts 
cc-polygons where one cell of the first column is marked. Then, C2 denotes the 
set of cc-polygons in which one cell of the first column is marked (say, with 
a dot), and one cell of the last column is marked as well (say, with a cross). 
The corresponding g.f. is denoted Cz(x,y). Our recursive construction of the 
polygons of C is illustrated in Fig. 13. 

We say that a cc-polygon is thick if deleting the bottom cell of each column 
gives a cc-polygon of the same width. These thick polygons are obtained by 
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Fig. 13. A recursive construction of column-convex polygons. 

duplicating the bottom cell in each column of a cc-polygon, so that their 
generating function is yC[x, y). 

Among non-thick cc-polygons, we find the single cell polygon (g.f. xy), 
and then those of width at least 2. Let P be in the latter class, and denote 
its columns C\, . . . , C/., from left to right. The fact that P is not thick means 
that there exist two consecutive columns C, and Cj+i that overlap by one 
edge only. Let i be minimal for this property. Two cases occur: 

- the first column has height 1. In particular, i = 1. These polygons are 
obtained by adding a cell to the left of any cc-polygon having a marked 
cell in its first column, next to the marked cell. They are thus counted by 
xd(x,y), 

- otherwise, the columns Ci,...,Cj form a thick cc-polygon Pi, and the 
columns Cj+x , . . . , form a general cc-polygon P^. There are several 
ways of concatenating these two polygons in such a way they share only 
one edge: 

- either the shared edge is at the bottom of Ci and at the top of Cj+i: 
such polygons are counted by C(x,y) 2 , 

- or the shared edge is at the top of Ci and at the bottom of Cj+i: such 
polygons are also counted by C(x,y) 2 , 

- if Cj+i has height at least 2, there are no other possibilities. However, 
if Ci+i consists of one cell only, this cell may be adjacent to any cell 
of Ci, not only to the bottom or top ones. The case where C^+i is the 
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last column of P is counted by xy(C\(x, y) — C(x, y)). The case where 
i + 1 < k is counted by x(Ci(x,y) 2 — C(x,y)Ci(x,y)). 

Let us drop the variables x and y in the series C, C\ and C 2 . The above 
discussion gives the equation: 

C = yC + xy + xC 1 + 2C 2 + xy(d - C) + x{C 2 - CC X ). 

The construction of Fig. 13 can now be recycled to obtain an equation for 
the series Ci, counting cc-polygons with a marked cell in the last column. 
Note that the first case of the figure (thick polygons) gives rise to two terms, 
depending on whether the marked cell is one of the duplicated cells, or not: 

d = y(C + Ci) + xy + xC 2 + 2CC X + xy(d - C) + x(dd - CC 2 ). 

We need a third equation, as three series (namely C, C\ and C 2 ) are now 
involved. There are two ways to obtain a third equation: 

- either we interpret C\ as the g.f. of cc-polygons where one cell is marked 
in the first column. The construction of Fig. 13 gives: 

d = y(C + C x ) + xy + xC x + 2(C + d)C + xy((d - C) + (C 2 - d)) 

+ x((Cl-Cd) + {dd~Cl)). 

Note that now many cases give rise to two terms in the equation. 

- or we work out an equation for C 2 using the decomposition of Fig. 13. 
Again, many of the cases schematized in this figure give rise to several 
terms. In particular, the first case (thick polygons) gives rise to 4 terms: 

C 2 = y{C+2d+C 2 )+x y +xC 2 +2(C+d)d+xy((d-C) + (C 2 -d)) 

+ x((Cid - CC 2 ) + [C\ - dd)). 

Both strategies of course give the same equation for C = C(x,y), after the 
elimination of Ci and C 2 - 

(-5xy -18 + 2xy 2 - 18 y 2 + 36 y + 2 x) C 4 

+ (y - 1) (5 xy 2 - 21 y 2 + 42 y - 14 xy + 5 x - 21) C 3 
+ 2 (y- if (-Ay 2 + 2xy 2 + 8y- 7xy~4 + 2x) C 2 
+ (y- if (xy 2 - y 2 + 2y - 6xy + x - 1) C - xy (y - if = 0. 

This quartic has 4 roots, among which the g.f. of cc-polygons can be identified 
by checking the first few coefficients. This series turns out to be unexpectedly 
simple: 



C(x,y) = (l-y) 



( \ 

2 V2 



3 V2 - Jl + x + J(l - xf - 16 
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Feretic has provided direct combinatorial explanations for this formula [28, 
29]. The algebraic equation satisfied by C(t,t) was first 3 obtained (via a 
context-free language) in [22] . The method we have used is detailed in [26] . 

Directed column-convex polygons by perimeter 

It is not hard to restrict the construction of Fig. 13 to directed cc-polygons. 
This is illustrated in Fig. 14. Note that the case where the columns Cj and 
Cj+i share the bottom edge of Cj (the fourth case in Fig. 13) is only possible 
if C\ + i has height 1. Moreover, only one additional series is needed, namely 
that of directed cc-polygons marked in the last column (Z?i). 




Fig. 14. A recursive construction of directed column-convex polygons. 

One obtains the following equations: 

D = yD + xy + xD + xD 2 + xyD + D 2 + xy(Di - D) + x{D 1 - D)D, 
L>i =y(D + D 1 ) + xy + xD x + xDD x + xyD + DD X + xy(D 1 - D) 

+x(D 1 -D)D 1 . 

Eliminating D\ gives a cubic equation for the series D = D(x,y): 

3 Eq. (32) in [22] has an error: the coefficient of t 6 c 3 in p2 should be —40 instead 
of +40. 



22 Mireille Bousquet-Melou and Richard Brak 



D 3 + 2 (y - 1) D 2 + (y - 1) {x + y - 1) D + xy (y - 1) = 0. 

This equation was first obtained in [21]. The first few terms of the semi- 
perimeter generating function are 

D(t, t)=t 2 + 2t 3 + 6t 4 + 20i 5 + 71< 6 + 263t 7 + 1005t 8 + 3933< 9 + • • • 
Directed polyominoes by area 

Let us move to a class that admits a neat, but non-obvious, algebraic structure: 
directed polyominoes counted by area. This structure was discovered when 
Viennot developed the theory of heaps [53] . Intuitively, a heap is obtained by 
dropping vertically some solid pieces, one after the other. Thus, a piece lies 
cither on the "floor" (when it is said to be minimal), or at least partially 
covers another piece. 

Directed polyominoes are, in essence, heaps. To see this, replace every cell 
of the polyomino by a dimer, after a 45 degree rotation (Fig. 15). This gives 
a heap with a unique minimal piece. Such heaps arc called pyramids. If the 
columns to the left of the minimal piece contain no dimer, we say we have a 
half-pyramid (Fig. 15, right). 




Fig. 15. Left: A directed polyomino and the associated pyramid. Right: a half- 
pyramid. 

The interest in heaps lies in the existence of a product of heaps: The product 
of two heaps is obtained by putting one heap above the other and dropping its 
pieces. Conversely, one can factor a heap by pushing upwards one or several 
pieces. See an example in Fig. 16. This product is the key in our algebraic 
description of directed polyominoes, or, equivalently, of pyramids of dimers, 
as we now explain. 

A pyramid is either a half-pyramid, or the product of a half-pyramid and 
a pyramid (Fig. 17, top). Let D(q) denote the g.f. of pyramids counted by the 
number of dimers, and H (q) denote the g.f. of half-pyramids. Then D(q) = 
H(q)(l + D(q)). 

Now, a half-pyramid can be a single dimer. If it has several dimers, it is the 
product of a single dimer and of one or two half-pyramids (Fig. 17, bottom), 
which implies H(q) = q + qH(q) + qH 2 (q). 
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Fig. 16. A factorization of a pyramid into a pyramid and a half- pyramid. Observe 
that the highest dimer of the pyramid moves up as we lift the white dimer. 




Fig. 17. Decomposition of pyramids (top) and half- pyramids (bottom). 



Note that D(q) is also the area g.f. of directed polyominoes. A straightforward 
computation gives: 

This was first proved by Dhar [24] . The above proof is adapted from [7] . 
3.4 ^-Analogues 

By looking for the first column of height 1 in a bargraph, wc have described 
an algebraic construction of these polygons (Fig. 11) that proves that their 
perimeter g.f. is algebraic (Section 3.1). Let us now examine what happens 
when we try to keep track of the area of these polygons. 

As in Section 2.4, the key observation is that the area behaves additively 
when one concatenates two bargraphs, but increases by the width of the poly- 
gon when we duplicate the bottom row. (In contrast, the half-pcrimetcr simply 
increases by 1 during this operation.) This observation gives rise to the fol- 
lowing functional equation for the complete g.f. of bargraphs: 
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B(x, y, q) = yB(xq, y, q) + xyq + xqB(x, y, q) + xyqB(xq, y, q) 

+ xqB(xq,y,q)B(x,y,q). (9) 

This is a g-analogue of Equation (7) denning B(x,y, 1). This equation is no 
longer algebraic, and it is not clear how to solve it. It has been shown in [44] 
that it can be linearized and solved using a certain Ansatz. We will show in 
Section 4.1 a more systematic way to obtain B(x, y, q), which does not require 
any Ansatz. 

4 Adding a new layer: a versatile approach 

In this section we describe a systematic construction that can be used to find 
the complete g.f. of many classes of polygons having a convexity property [10]. 
The cost of this higher generalization is twofold: 

• it is not always clear how to solve the functional equations obtained in this 
way, 

• in contrast with the constructions developed in Sections 2 and 3, this 
approach does not provide combinatorial explanations for the rational- 
ity /algcbraicity of the corresponding g.f.s. 

This type of construction is sometimes called Temperley 's approach since Tem- 
perley used it to write functional equations for the generating function of 
column-convex polygons counted by perimeter [52]. But it also occurs, in a 
more complicated form, in other "old" papers [6, 40]. We would prefer to see 
a more precise terminology, like layered approach. 

4.1 A basic example: bargraphs by perimeter and area 

We return to our favourite example of bargraphs, and we now aim to find the 
complete g.f. B(x,y,q) of this class of polygons. We have just seen that the 
algebraic description of Fig. 11 leads to the g-algebraic equation (9), which is 
not obvious to solve. The linear description of Section 2.1 cannot be directly 
exploited either: in order to decide whether the addition of a cell at the top 
of the last column increases the perimeter or not, we need to know which of 
the last two columns is higher. 

We present here a variation of this linear construction that allows us to 
count bargraphs by area and perimeter, provided we also take into account 
the right height by a new variable s. By right height, we mean the height of 
the rightmost column. The g.f. we are interested in is now 

B(x,y,q,s) = ^2B h (x,y,q)s h , 

h>l 

where Bh(x, y, q) is the complete g.f. of bargraphs of right height h. 

Our new construction is illustrated in Fig. 18. The class B of bargraphs is 
split into three disjoint subsets: 
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i u d „ tii- 1- 



Fig. 18. A third recursive construction of bargraphs. 

1. bargraphs of width 1 (columns): the g.f. of this class is xysq/(l — ysq), 

2. bargraphs in which the last column is at least as high as the next-to-last 
column. These bargraphs are obtained by duplicating the last column of a 
bargraph (which boils down to replacing s by sq in the series B(x, y, q, s)), 
and adding a (possibly empty) column at the top of the newly created 
column. The corresponding g.f. is thus 

x 

B{x,y,q,sq). 



1 - ysq 

3. bargraphs in which the last column is lower than the next-to-last column. 
To obtain these, we start from a bargraph, say of right height h, and add 
a new column of height I < h to the right. The g.f. of this third class is: 

Bh{x,y,q) 



x B h{x, V, q) ^{sqf = x ^ 

h>l \ t=l ) h>l 



1 — sq 

sqB(x, y, q, 1) - B(x, y, q, sq) 



(10) 



1 — sq 

Writing B(s) = B(x, y, q, s), and putting together the three cases, we obtain: 

B(s) = + JOH-BiD + n Xa «»-V B(sq). (11) 

I- ysq 1-sq (1 - sq)(l - ysq) 

This equation is solved in two steps: first, an iteration, similar to what we did 
for Ferrers diagrams in (4) (Section 2.4), provides an expression for B(s) in 
terms of B(l): 

= {xs{y- l))"~ 1 g(S) / X ysq n + xsq n 
n>i ( s 9)«-i(y s, ?)«-i \ 1 -V s< l n l-sq n 

Then, one sets s = 1 to obtain the complete g.f. B(l) = B(x,y,q,l) of 
bargraphs: 

B(x,y,q,l) = T ^j- (12) 

with 

_ x n (y - l) n - 1 q('^ 1 ) ^ _ x n (y - l)"- 1 q("^ 1 ) 

+ n>l (Q)n-l(yQ)n ^ {q)n(yq)n-l 
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4.2 More examples 

In this section, we describe how to apply the layered approach to two other 
classes of polygons: staircase and column-convex polygons, counted by perime- 
ters and area simultaneously. In passing we show how the difference of g.f.s 
resulting from a geometric summation like (10) can be explained combinato- 
rially by an inclusion-exclusion argument. 

Staircase polygons 

As with the bargraph example above, we define an extended generating func- 
tion which tracks the height of the rightmost column of the staircase polygon, 

S(x, y, q,s) = s h(x, y, q)s h , 

h>l 

where Sh(x,y,q) is the generating function of staircase polygons with right 
height h. The set of all staircase polygons can be partitioned into two parts 
(Fig. 19): 

1. those which have only one column: the g.f. for this class is xyqsj (1 — yqs), 

2. those which have more than one column: their g.f. is obtained as the 
difference of the g.f. of two sets as follows. 

Staircase polygons of width £ > 2 can be split into two objects: a staircase 
polygon formed of the i — 1 first columns, and the rightmost column. The left 
part has generating function S(x,y,q,l) (ignoring the rightmost height), to 
which we then attach a column of cells. The attached column is constrained 
in that it must not extend below the bottom of the rightmost column of the 
left part. It is generated (see Fig. 19) by gluing a descending column (with g.f. 




Fig. 19. The two types of staircase polygons. 

1/(1 — qs)) and an ascending column (with g.f. 1/(1 — yqs)) to a single square 
(with g.f. xyqs). The single square is required to ensure that the column is not 
empty and is glued to the immediate right of the topmost square of the left 
part. An important observation is that only the ascending column contributes 
in the increase of the vertical perimeter. This gives the generating function 
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S(x,y,q, 1) ■ xyqs ■ . 

1 — qs 1 — yqs 

This construction however results in configurations which might have the 
rightmost column extending below the rightmost column of the left part. We 
must thus subtract the contribution of these "bad" configurations from the 
above g.f. We claim that they are generated by 

S(x, y, q, sq) ■ xyqs ■ — - — . 

1 — qs 1 — yqs 

The replacement of s with sq in S(x, y 7 q, sq) is interpreted as adding a copy 
of the last column of the left part, as illustrated in Fig. 20. The xyqs factor 
is interpreted as attaching a new cell to the bottom of the duplicated column 
(thus ensuring the rightmost column is strictly below the rightmost column 
of the left part). Finally, we add a descending and an ascending column. 
Again, the height of the latter must not be taken into account in the vertical 
perimeter. 



a) 



S(x.y. ■/..<) 



S{x,y,q,sq) 



b) 













J 






S h 














_J 







i 



1 - yqs 



S(x,y,q,sq) 
}. xyqs 



l-qs 



Fig. 20. a) Replacing s by sq in S(x,y,q,s) duplicates the last column of the 
polygon, b) Generating function of "bad" configurations. 



Thus the final equation for the generating function is 

b(x,y,q,s) = h [b{x,y,q,l) - b{x,y,q,sq))- — -. 

I -yqs (1 -qs){\ -yqs) 

It can also be obtained via geometric sums, as was done for (10). The equation 
is solved with the same two step process as for bargraphs. First we iterate it 
to obtain S(x, y, q, s) in terms of S(x, y, q, 1), and then we set s to 1, obtaining 

S(x,y,q,l) = y^-, 
Jo 

where Jq and J\ are two q— Bessel functions [34]: 
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■h{x,y,q) = 



fc>l 



k ( k+1 \ 
fc+i x V 2 ' 

(q)k-i(yq)k 



and 



J Q (x,y,q) = 



(q)k(yq) 




Note, the appropriate limit as q — > 1 leads to standard Besscl functions which 
are related to the generating function for semi-continuous staircase polygons 
see [18] for details. 

Column-convex polygons 

The case of column-convex polygons is more complex and we will not give 
all the details but only discuss the primary additional complication. We refer 
to [10] for a complete solution. Like for staircase polygons, a functional equa- 
tion for column-convex polygons can be obtained by considering the rightmost 
(last) column. The position of the last column compared with the second last 
column must be carefully considered. Again there are several cases depending 
on whether the top (resp. bottom) of the last column is strictly above, at the 
same level or below the top (resp. bottom) of the second-last column. The 
case which leads to a type of term that does not appear in the equation for 
staircase polygons is the case where the top (resp. bottom) of the last col- 
umn cannot be above (resp. below) the top (resp. bottom) of the second-last 
column. Thus we will only explain this case which we will refer to as the 
contained the last column is somehow contained in the previous one. 

If the generating function for the column-convex polygons is C(s) = 
C(x,y,q,s) then we claim that the polygons falling into the contained case 
are counted by the generating function 



Thus we see we now need a derivative of the generating function. As a polygon 
of right height h contributes h times to the series dC/ds(l), this series counts 
polygons with a marked cell in the rightmost column. 

Let us now explain this expression, which is illustrated in Fig. 21. We 
consider a polygon as the concatenation of a left part with a new (rightmost) 
column C. In the left part, we mark the cell of the rightmost column that is at 
the same level as the bottom cell of C. So, starting from a marked polygon, we 
first add a single square to the right of the marked cell - this gives a factor xsq. 
Above this square we then add an ascending column which is generated by 
1/(1 — sq). However, as with the staircase polygons, the resulting series counts 
"bad" configurations, where the last column ends strictly higher than the 
second last column. We subtract the contribution of these bad configurations 



xsq dC 



(1) 




(C(l)-C(sq)). 



(13) 



1 — sq ds 



(1 - sq)* 
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Fig. 21. A schematic representation of the equation for the case where the rightmost 
column does not extend above or below the second-last column. 



by generating them as shown on the second picture of Fig. 21. This results in 
subtracting the term xq 2 s 2 C '(1) / '(1 — sq) 2 . However, we have now subtracted 
too much! Indeed, some configurations counted by the latter series have a 
rightmost column that ends below the second last column. We correct this by 
adding the contribution of these configurations, which is xq 2 s 2 C(sq) /(l — sq) 2 
(Fig. 21, right). This establishes (13) for the g.f. of the contained case. 

The other cases are simpler, and in the same vein as what was needed for 
staircase polygons. Considering all cases gives 



C(s) 



xsq 



dC 



xsyq 



1 — syq 1 — sq ds 



(1) 



xs 2 q 2 (2y — syq — 1) 
(1 - sq) 2 (l - syq) 



C(l) 



xs 2 q 2 (l — y) 2 
(1 - sq) 2 {l - syq} 



: C(sq). (14) 



In order to solve this equation, we first iterate it to obtain C(s) in terms of 
C(l) and C"(l) = dC/ds(l). Setting s = 1 gives a linear equation between 
C(l) and C"(l). Setting s — 1 after having differentiated with respect to s 
gives a second linear equation between C(l) and C"(l). We end up solving a 
linear system of size 2, and obtain C(l) as a ratio of two 2x2 determinants. 
The products of series that appear in these determinants can be simplified, 
and the final expression reads 



C(x,y,q, 1) = y 



1 + W + yX 



where 



X 



xq 



(l-y)(l-yq) 



E 



(-l)"+ 1 x"(l~ 2/ ) 2 "- 4 g (^ 1 )(t/ 2 g ) 2ra . 

(g)n-i (m)n-2 {yq)l-i (yq) n {y 2 q)n- 



and 
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w _ y (-l)"x"(l - yf^£*%? t qhn-i 
n>i (<?)« {yq)l-i (yq) n (y 2 q) n -i 

The first solution, involving a more complicated expression, was given in [17]. 
The one above appears as Theorem 4.8 in [10]. 

4.3 The kernel method 

In Sections 2 and 3, we have explained combinatorially why the area g.f. 
of bargraphs, B(l,l,q), and the perimeter g.f. of bargraphs, B(x, y, 1), arc 
respectively rational and algebraic. It is natural to examine whether these 
properties can be recovered from the construction of Fig. 18 and the functional 
equation (11). 

As soon as we set y = 1 in this equation, the main difficulty, that is, 
the term B(sq), disappears. We can then substitute 1 for s and solve for 
B{x 1 1, q, 1), the width and area g.f. of bargraphs. This series is found to be 

B{x,l,q,l) = 



1 — q — xq 



From this, one also obtains a rational expression for the series B(x, 1, q, s). 
The rationality of B(x, l,q, 1) also follows directly from the expression (12): 
setting y = 1 shrinks the series 2+ and I- to simple rational functions. 

How the perimeter g.f. of bargraphs can be derived from the functional 
equation (11) is a more challenging question. Setting q = 1 gives 

*(-) = + -ff-fl(l) + X *%-V B (s). (15) 

1 — ys 1 — s (1 — s)(l — ys) 

This equation cannot be simply solved by setting s = 1. Instead, the solution 
uses the so-called kernel method, which has proved useful in a rather large 
variety of enumerativc problems in the past 10 years [3, 4, 14, 19, 32, 47]. 
This method solves, in a systematic way, equations of the form: 

K(s, x)A(s, x) = P(x, s, A-l(x), . . . , A k {x)) 

where K(s,x) is a polynomial in s and the other indeterminates x = 
{x\, . . . ,£„), P is a polynomial, A(s,x) is an unknown series in s and the 
s^s, while the series Ai(x) only depend on the Xj's. (It is assumed that the 
equation uniquely defines all these unknown series.) We refer to [14] for a 
general presentation, and simply illustrate the method on (15). We group the 
terms involving B(s), and multiply the equation by (1 — s) to obtain: 

- » ^L^l) B (s) = Xys{1 S) + xsB(l). (16) 
1 - ys J 1 - ys 



Let S = S(x, y) be the only formal power series in x and y that satisfies 
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S 



1 



*S(y - 1) 
1-yS 



That is, 



S = 



1 — x + y + xy 



^(l-y)((l-x) 2 -y(l + x) 2 ) 



2y 



Replacing s by S in (16) gives an identity between series in x and y. By 
construction, the left-hand side of this identity vanishes. This gives 



and we have recovered the algebraic expression (6) of the perimeter g.f. of 
bargraphs. 

5 Some open questions 

We conclude this chapter with a list of open questions. As mentioned in the 
introduction, the combination of convexity and direction conditions gives rise 
to 35 classes of polyominoes, not all solved. But all these classes are certainly 
not equally interesting. The few problems we present below have two impor- 
tant qualities: they do not seem completely out of reach (we do not ask about 
the enumeration of all polyominoes) and they have some special interest: they 
deal either with large classes of polyominoes, or with mysterious classes (that 
have been solved in a non-combinatorial fashion), or they seem to lie just at 
the border of what the available techniques can achieve at the moment. 

5.1 The quasi-largest class of quasi-solved polyominoes 

Let us recall that the growth constant of n-cell polyominoes is conjectured 
to be a bit more than 4. More precisely, it is believed that p n , the number 
of such polyominoes, is equivalent to up to a multiplicative constant, 

with \i — 4.06... [37]. The techniques that provide lower bounds on fi involve 
looking at bounded polyominoes (for instance polyominoes lying in a strip of 
fixed height k) and a concatenation argument. See [5] for a recent survey and 
the best published lower bound, 3.98.... It is not hard to see that for k fixed, 
these bounded polyominoes have a linear structure, and a rational generating 
function. This series is obtained either by adding recursively a whole "layer" 
to the polyomino (as we did for self-avoiding polygons in Section 2.3), or 
by adding one cell at a time. The latter approach is usually more efficient 
(Chapter 6). 



B(l) = B(x,y,l,l) 



y(s-i) 
i-yS 



l-x-y-xy-y/(l- y)((l - x) 2 - y(l + x) 2 ) 



2x 
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What about solved classes of polyominoes that do not depend on a param- 
eter k, and often have a more subtle structure? We have seen in Section 3.3 
that the g.f. of directed polyominoes is algebraic, with growth constant 3. 
This is "beaten" by the growth constant 3.20... derived from the rational 
g.f. of column-convex polyominoes (Section 2.3). A generalization of directed 
polyominoes (called multi-directed polyominoes) was introduced in [15] and 
proved to have a fairly complicated g.f., with growth constant about 3.58. To 
our knowledge, this is the largest growth constant reached from exact enu- 
meration (again, apart from the rational classes obtained by bounding column 
heights). 

However, in 1967, Klarner introduced a "large" class of polyominoes that 
seems interesting and would warrant a better understanding [39]. His defi- 
nition is a bit unclear, and his solution is only partial, but the estimate he 
obtains of the growth constant is definitely appealing: about 3.72. Let us 
mention that the triangular lattice version of this mysterious class is solved 
in [15]. The growth constant is found to be about 4.58 (the growth constant 
of triangular lattice animals is estimated to be about 5.18, see [54]). 

5.2 Partially directed polyominoes 

This is another generalization of directed polyominoes, with a very natural 
definition: the corresponding animal A contains a source point vq from which 
every other point can be reached by a path formed of North, East and West 
steps, only visiting points of A (Fig. 22(a)). This model has a slight flavour 
of heaps of pieces, a notion that has already proved useful in the solution 
of several polyomino models (see Section 3.3 and [7, 16, 15]). The growth 
constant is estimated to be around 3.6, and, if proved, would thus improve 
that of multi-directed animals [45]. 




Fig. 22. (a) A partially directed animal. The source can be any point on the bottom 
row. (b) A directed animal on the square lattice, with the right neighbours indicated 
in white, (c) A directed animal A on the triangular lattice. The distinguished points 
are those having (only) their South neighbour in A. 
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5.3 The right site-perimeter of directed animals 

We wrote in the introduction that almost all solved classes of polyominoes 
can be solved by one of the three main approaches we present in this chapter. 
Here is one simple-looking result that we do not know how to prove via these 
approaches (nor via any combinatorial approach, to be honest). 

Take a directed animal A, and call a neighbour of A any point that does 
not lie in A, but could be added to A to form a new directed animal. The 
number of neighbours is the site-perimeter of A. The right site-perimeter of A 
is the number of neighbours that lie one step to the right of a point of A. It 
was proved in [11] that the g.f of directed animals, counted by area and right 
site-perimeter, is a very simple extension of (8): 



n , v X (1 + q)(l + q - qx) \ 

D(q > X)= 2{i l- q (2 + X )+^l-x) - l ) ' 

The proof is based on an equivalence with a one-dimensional gas model, in- 
spired by [25] . It is easy to see that the right site-perimeter is also the number 
of vertices v of A whose West neighbour is not in A. (By the West neighbour, 
we mean the point at coordinates (i — 1, j) if v = 

Described in these terms, this result has a remarkable counterpart for 
triangular lattice animals (Fig. 22). Let us say that a point of the animal 
has a West (resp. South, South- West) neighbour in A if the point (i — 
(resp. (i, j — 1), (i — 1, j — 1)) is also in A. Then the g.f. that counts these 
animals by the area and the number of points having a SW-neighbour (but 
no W- nor S-neighbour) is easy to obtain using heaps of dimers and the ideas 
presented in Section 3.3: 



D(q,x) = - 



2 y V 1 — 3q — qx 

What is less easy, and is so far only proved via a correspondence with a gas 
model, is that D(q, x) also counts directed animals (on the triangular lattice) 
by the area and the number of points having a South neighbour (but no SW- 
nor W-neighbour). Any combinatorial proof of this result would give a better 
understanding of these objects. One possible starting point may be found in 
the recent paper [41], which sheds some combinatorial light on the gas models 
involved in the proof of the above identities. 



5.4 Diagonally convex polyominoes 

Let us conclude with a problem that seems to lie at the border of the ap- 
plicability of the third approach presented here (the layered approach). In 
the enumeration of, say, column-convex polyominoes (Section 4.2), we have 
used the fact that deleting the last column of such a polyomino gives another 
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column-convex polyomino. This is no longer true of a G?_-convex polyomino 
from which we would delete the last diagonal (Fig. 1(d)). Still, it seems that 
this class is sufficiently well structured to be exactly enumerable. Note that 
this difficulty vanishes when studying the restricted class of directed diagonally 
convex polyominoes [8, 46], which behave approximately like column-convex 
polyominocs. 
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